Use vectorised [`str`](http://pandas.pydata.org/pandas-docs/stable/api.html#string-handling) methods to slice each string entry

    In [11]:
    d[&#39;Report Number&#39;] = d[&#39;Report Number&#39;].str[3:]
    d

    Out[11]:
         Name Report Number
    0  George       1234567
    1    Bill       9876543
    2   Sally       4434555

It is worth noting Pandas &quot;vectorised&quot; `str` methods are no more than Python-level loops.

Assuming clean data, you will often find a list comprehension more efficient:

    # Python 3.6.0, Pandas 0.19.2

    d = pd.concat([d]*10000, ignore_index=True)
    
    %timeit d[&#39;Report Number&#39;].str[3:]           # 12.1 ms per loop
    %timeit [i[3:] for i in d[&#39;Report Number&#39;]]  # 5.78 ms per loop

Note these aren&#39;t equivalent, since the list comprehension does not deal with null data and other edge cases. For these situations, you may prefer the Pandas solution.

I am working with `angular2-google-maps` and latest version of Angular2. I am trying to convert some of the local map component functions into services in their own file `maps.service.ts`. For example:

map.component.ts

    getGeoLocation(lat: number, lng: number) {
    if (navigator.geolocation) {
        let geocoder = new google.maps.Geocoder();
        let latlng = new google.maps.LatLng(lat, lng);
        let request = { latLng: latlng };
        geocoder.geocode(request, (results, status) =&gt; {
          if (status == google.maps.GeocoderStatus.OK) {
            let result = results[0];
            if (result != null) {
              this.fillInputs(result.formatted_address);
            } else {
              alert(&quot;No address available!&quot;);
            }
          }
        });
    }
    }

Into something like: `maps.service.ts`

    getGeoLocation(lat: number, lng: number): Observable&lt;google.maps.GeocoderResult[]&gt; {
        let geocoder = new google.maps.Geocoder();
        let latlng = new google.maps.LatLng(lat, lng);
        let request = { latLng: latlng };
        return new Observable((observer: Observer&lt;google.maps.GeocoderResult[]&gt;) =&gt; {
            geocoder.geocode({ request }, (
                (results: google.maps.GeocoderResult[], status: google.maps.GeocoderStatus) =&gt; {
                    if (status == google.maps.GeocoderStatus.OK) {
                        observer.next(results);
                        observer.complete();
                    } else {
                        console.log(&#39;Geocoding service failed due to: &#39; +status);
                        observer.error(status);
                    }
                }
            ));
        });
    }

The issue I&#39;m getting is that `google` variable is not being recognized when I try to use `Observer&lt;google.maps.GeocoderResult[]&gt;`. I have `declare var google: any;` outside of the service class as well.

The `google` variable works in my `map.componenet.ts` but doesn&#39;t get recognized in the `maps.service.ts`.

Angular2 Cannot find namespace &#39;google&#39;

How do I define a unique index on a combination of columns in sequelize. For example I want to add a unique index on user_id, count and name. 

    var Tag = sequelize.define(&#39;Tag&#39;, {
            id: {
                type: DataTypes.INTEGER(11),
                allowNull: false,
                primaryKey: true,
                autoIncrement: true
            },
            user_id: {
                type: DataTypes.INTEGER(11),
                allowNull: false,
            },
            count: {
                type: DataTypes.INTEGER(11),
                allowNull: true
            },
            name: {
                type: DataTypes.STRING,
                allowNull: true,
            })

How to define unique index on multiple columns in sequelize

I have a Python dataframe with about 1,500 rows and 15 columns. With one specific column I would like to remove the first 3 characters of each row. As a simple example here is a dataframe:

    import pandas as pd

    d = {
        &#39;Report Number&#39;:[&#39;8761234567&#39;, &#39;8679876543&#39;,&#39;8994434555&#39;],
        &#39;Name&#39;         :[&#39;George&#39;, &#39;Bill&#39;, &#39;Sally&#39;]
         }

    d = pd.DataFrame(d)

I would like to remove the first three characters from each field in the `Report Number` column of dataframe `d`.

Remove first x number of characters from each row in a column of a Python dataframe

I have a Python dataframe with about 1,500 rows and 15 columns. With one specific column I would like to remove the first 3 characters of each row. As a simple example here is a dataframe:
<pre><code class="hljs language-java">import pandas as pd

d = {
 'Report Number':['8761234567', '8679876543','8994434555'],
 'Name' :['George', 'Bill', 'Sally']
 }

d = pd.DataFrame(d)
</code></pre>
I would like to remove the first three characters from each field in the <code>Report Number</code> column of dataframe <code>d</code>.

How can a pre-existing conda environment be updated with another .yml file. This is extremely helpful when working on projects that have multiple requirement files, i.e. `base.yml, local.yml, production.yml`, etc.

For example, below is a `base.yml` file has conda-forge, conda, and pip packages:

base.yml

    name: myenv
    channels:
      - conda-forge
    dependencies:
      - django=1.10.5
      - pip:
        - django-crispy-forms==1.6.1

The actual environment is created with:
`conda env create -f base.yml`.

Later on, additional packages need to be added to `base.yml`. Another file, say `local.yml`, needs to import those updates.

Previous attempts to accomplish this include:

creating a `local.yml` file with an import definition:

    channels:

    dependencies:
      - pip:
        - boto3==1.4.4
    imports:
      - requirements/base. 

And then run the command:
`conda install -f local.yml`. 

This does not work. Any thoughts?

How to update an existing Conda environment with a .yml file

In Python 3, to load json previously saved like this:

`json.dumps(dictionary)`

the output is something like

`{&quot;(&#39;Hello&#39;,)&quot;: 6, &quot;(&#39;Hi&#39;,)&quot;: 5}`

when I use

`json.loads({&quot;(&#39;Hello&#39;,)&quot;: 6, &quot;(&#39;Hi&#39;,)&quot;: 5})`

it doesn&#39;t work, this happens:
```none
TypeError: the JSON object must be str, bytes or bytearray, not &#39;dict&#39;
```

JSON object must be str, bytes or bytearray, not dict

Why is `x**4.0` faster than `x**4`? I am using CPython 3.5.2.

    $ python -m timeit &quot;for x in range(100):&quot; &quot; x**4.0&quot;
      10000 loops, best of 3: 24.2 usec per loop

    $ python -m timeit &quot;for x in range(100):&quot; &quot; x**4&quot;
      10000 loops, best of 3: 30.6 usec per loop

I tried changing the power I raised by to see how it acts, and for example if I raise x to the power of 10 or 16 it&#39;s jumping from 30 to 35, but if I&#39;m raising by **10.0** as a float, it&#39;s just moving around 24.1~4.

I guess it has something to do with float conversion and powers of 2 maybe, but I don&#39;t really know.

I noticed that in both cases powers of 2 are faster, I guess since those calculations are more native/easy for the interpreter/computer. But still, with floats it&#39;s almost not moving. `2.0 =&gt; 24.1~4 &amp; 128.0 =&gt; 24.1~4` **but** `2 =&gt; 29 &amp; 128 =&gt; 62`

&lt;hr&gt; [TigerhawkT3](https://stackoverflow.com/users/2617068/tigerhawkt3) pointed out that it doesn&#39;t happen outside of the loop. I checked and the situation only occurs (from what I&#39;ve seen) when the **base** is getting raised. Any idea about that?

Why is x**4.0 faster than x**4 in Python 3?

If I have a construct like this:

    def foo():
        a=None
        b=None
        c=None

        #...loop over a config file or command line options...

        if a is not None and b is not None and c is not None:
            doSomething(a,b,c)
        else:
            print &quot;A config parameter is missing...&quot;

What is the preferred syntax in python to check if all variables are set to useful values?  Is it as I have written, or another better way?

This is different from this question:
https://stackoverflow.com/questions/3965104/not-none-test-in-python ... I am looking for the preferred method for checking if many conditions are not None.  The option I have typed seems very long and non-pythonic.

What is the most pythonic way to check if multiple variables are not None?

I want to use excel files to store data elaborated with python. My problem is that I can&#39;t add sheets to an existing excel file. Here I suggest a sample code to work with in order to reach this issue

    import pandas as pd
    import numpy as np

    path = r&quot;C:\Users\fedel\Desktop\excelData\PhD_data.xlsx&quot;

    x1 = np.random.randn(100, 2)
    df1 = pd.DataFrame(x1)

    x2 = np.random.randn(100, 2)
    df2 = pd.DataFrame(x2)

    writer = pd.ExcelWriter(path, engine = &#39;xlsxwriter&#39;)
    df1.to_excel(writer, sheet_name = &#39;x1&#39;)
    df2.to_excel(writer, sheet_name = &#39;x2&#39;)
    writer.save()
    writer.close()

This code saves two DataFrames to two sheets, named &quot;x1&quot; and &quot;x2&quot; respectively. If I create two new DataFrames and try to use the same code to add two new sheets, &#39;x3&#39; and &#39;x4&#39;, the original data is lost.

    import pandas as pd
    import numpy as np

    path = r&quot;C:\Users\fedel\Desktop\excelData\PhD_data.xlsx&quot;

    x3 = np.random.randn(100, 2)
    df3 = pd.DataFrame(x3)

    x4 = np.random.randn(100, 2)
    df4 = pd.DataFrame(x4)

    writer = pd.ExcelWriter(path, engine = &#39;xlsxwriter&#39;)
    df3.to_excel(writer, sheet_name = &#39;x3&#39;)
    df4.to_excel(writer, sheet_name = &#39;x4&#39;)
    writer.save()
    writer.close()

I want an excel file with four sheets: &#39;x1&#39;, &#39;x2&#39;, &#39;x3&#39;, &#39;x4&#39;.
I know that &#39;xlsxwriter&#39; is not the only &quot;engine&quot;, there is &#39;openpyxl&#39;. I also saw there are already other people that have written about this issue, but still I can&#39;t understand how to do that.

Here a code taken from this [link][1]

    import pandas
    from openpyxl import load_workbook

    book = load_workbook(&#39;Masterfile.xlsx&#39;)
    writer = pandas.ExcelWriter(&#39;Masterfile.xlsx&#39;, engine=&#39;openpyxl&#39;) 
    writer.book = book
    writer.sheets = dict((ws.title, ws) for ws in book.worksheets)

    data_filtered.to_excel(writer, &quot;Main&quot;, cols=[&#39;Diff1&#39;, &#39;Diff2&#39;])

    writer.save()


They say that it works, but it is hard to figure out how. I don&#39;t understand what &quot;ws.title&quot;, &quot;ws&quot;, and &quot;dict&quot; are in this context. 

Which is the best way to save &quot;x1&quot; and &quot;x2&quot;, then close the file, open it again and add &quot;x3&quot; and &quot;x4&quot;?

  [1]: https://stackoverflow.com/questions/20219254/how-to-write-to-an-existing-excel-file-without-overwriting-data-using-pandas

How to save a new sheet in an existing excel file, using Pandas?

I want to convert a `ZonedDateTime` to a `String` in the format of `(&quot;dd/MM/yyyy - hh:mm&quot;)`. I know this is possible in Joda-Time other types, just using their `toString(&quot;dd/MM/yyyy - hh:mm&quot;)`....But this doesn&#39;t work with `ZonedDateTime.toString()`.

# **How can I format a `ZonedDateTime` to a `String`?**

____

EDIT:

I tried to print the time in another timezone and the result appears to be the same always: 

    ZonedDateTime date = ZonedDateTime.now();
    ZoneId la = ZoneId.of(&quot;America/Los_Angeles&quot;);
    ZonedDateTime date2 = date.of(date.toLocalDateTime(), la);
    
    // 24/02/2017 - 04:53
    System.out.println(DateTimeFormatter.ofPattern(&quot;dd/MM/yyyy - hh:mm&quot;).format(date));
    // same result as the previous one
    // 24/02/2017 - 04:53
    System.out.println(DateTimeFormatter.ofPattern(&quot;dd/MM/yyyy - hh:mm&quot;).format(date2));

And I am not in the same timezone as Los Angeles.

_____
EDIT 2:

Found how to change the timezones:

    // Change this:
    ZonedDateTime date2 = date.of(date.toLocalDateTime(), la); // incorrect!
    // To this:
    ZonedDateTime date2 = date.withZoneSameInstant(la);

How to format a ZonedDateTime to a String?

I have a string `var m = &quot;I random don&#39;t like confusing random code.&quot;` I want to delete all instances of the substring `random` within string `m`, returning string `parsed` with the deletions completed. 


The end result would be: `parsed = &quot;I don&#39;t like confusing code.&quot;`

How would I go about doing this in Swift 3.0+?

Deleting Specific Substrings in Strings [Swift]

I have two lists:

* a list of about 750K ***&quot;sentences&quot;*** (long strings)
* a list of about 20K ***&quot;words&quot;*** that I would like to delete from my 750K sentences

So, I have to loop through 750K *sentences* and perform about 20K replacements, **but ONLY if my words are actually *&quot;words&quot;* and are not part of a larger string of characters.**

I am doing this by pre-compiling my *words* so that they are flanked by the `\b` word-boundary metacharacter:

    compiled_words = [re.compile(r&#39;\b&#39; + word + r&#39;\b&#39;) for word in my20000words]

Then I loop through my *&quot;sentences&quot;*:

    import re
    
    for sentence in sentences:
      for word in compiled_words:
        sentence = re.sub(word, &quot;&quot;, sentence)
      # put sentence into a growing list

This nested loop is processing about **50 sentences per second**, which is nice, but it still takes several hours to process all of my sentences.

* Is there a way to using the `str.replace` method (which I believe is faster), but still requiring that replacements only happen at **word boundaries**?

* Alternatively, is there a way to speed up the `re.sub` method? I have already improved the speed marginally by skipping over `re.sub` if the length of my word is &gt; than the length of my sentence, but it&#39;s not much of an improvement.

I&#39;m using Python 3.5.2

Speed up millions of regex replacements in Python 3

The character &#128105;‍&#128105;‍&#128103;‍&#128102; (family with two women, one girl, and one boy) is encoded as such:

`U+1F469` [`WOMAN`](http://emojipedia.org/emoji/%F0%9F%91%A9/),  
`‍U+200D` [`ZWJ`](https://en.wikipedia.org/wiki/Zero-width_joiner),  
`U+1F469` `WOMAN`,  
`U+200D` `ZWJ`,  
`U+1F467` [`GIRL`](http://emojipedia.org/emoji/%F0%9F%91%A7/),  
`U+200D` `ZWJ`,  
`U+1F466` [`BOY`](http://emojipedia.org/emoji/%F0%9F%91%A6/)

So it&#39;s very interestingly-encoded; the perfect target for a unit test. However, Swift doesn&#39;t seem to know how to treat it. Here&#39;s what I mean:


    &quot;&#128105;‍&#128105;‍&#128103;‍&#128102;&quot;.contains(&quot;&#128105;‍&#128105;‍&#128103;‍&#128102;&quot;) // true
    &quot;&#128105;‍&#128105;‍&#128103;‍&#128102;&quot;.contains(&quot;&#128105;&quot;) // false
    &quot;&#128105;‍&#128105;‍&#128103;‍&#128102;&quot;.contains(&quot;\u{200D}&quot;) // false
    &quot;&#128105;‍&#128105;‍&#128103;‍&#128102;&quot;.contains(&quot;&#128103;&quot;) // false
    &quot;&#128105;‍&#128105;‍&#128103;‍&#128102;&quot;.contains(&quot;&#128102;&quot;) // true

So, Swift says it contains itself (good) and a boy (good!). But it then says it does not contain a woman, girl, or zero-width joiner. **What&#39;s happening here? Why does Swift know it contains a boy but not a woman or girl?** I could understand if it treated it as a single character and only recognized it containing itself, but the fact that it got one subcomponent and no others baffles me.

**This does not change if I use something like `&quot;&#128105;&quot;.characters.first!`.**

---

Even more confounding is this:

    let manual = &quot;\u{1F469}\u{200D}\u{1F469}\u{200D}\u{1F467}\u{200D}\u{1F466}&quot;
    Array(manual.characters) // [&quot;&#128105;‍&quot;, &quot;&#128105;‍&quot;, &quot;&#128103;‍&quot;, &quot;&#128102;&quot;]

Even though I placed the ZWJs in there, they aren&#39;t reflected in the character array. What followed was a little telling:

    manual.contains(&quot;&#128105;&quot;) // false
    manual.contains(&quot;&#128103;&quot;) // false
    manual.contains(&quot;&#128102;&quot;) // true

So I get the same behavior with the character array... which is supremely annoying, since I know what the array looks like.

**This also does not change if I use something like `&quot;&#128105;&quot;.characters.first!`.**

Why are emoji characters like &#128105;‍&#128105;‍&#128103;‍&#128102; treated so strangely in Swift strings?

This is my sample code:

    #include &lt;iostream&gt;
    #include &lt;string&gt;
    using namespace std;
    
    class MyClass
    {
    	string figName;
    public:
    	MyClass(const string&amp; s)
    	{
    		figName = s;
    	}
    
    	const string&amp; getName() const
    	{
    		return figName;
    	}
    };

    ostream&amp; operator&lt;&lt;(ostream&amp; ausgabe, const MyClass&amp; f)
    {
    	ausgabe &lt;&lt; f.getName();
    	return ausgabe;
    }
    
    int main()
    {
    	MyClass f1(&quot;Hello&quot;);
    	cout &lt;&lt; f1;
        return 0;
    }

If I comment out `#include &lt;string&gt;` I don&#39;t get any compiler error, I guess because it&#39;s kind of included through `#include &lt;iostream&gt;`. If I *&quot;right-click --&gt; Go to Definition&quot;* in Microsoft VS they both point to the same line in the `xstring` file:

    typedef basic_string&lt;char, char_traits&lt;char&gt;, allocator&lt;char&gt; &gt;
    	string;

But when I run my program, I get an exception error:

&gt; 0x77846B6E (ntdll.dll) in OperatorString.exe: 0xC00000FD: Stack overflow (Parameter: 0x00000001, 0x01202FC4)

Any idea why I get a runtime error when commenting out `#include &lt;string&gt;`? I&#39;m using VS 2013 Express.

Why is #include &lt;string&gt; preventing a stack overflow error here?

Trying to create a new column in the netc df but i get the warning

    netc[&quot;DeltaAMPP&quot;] = netc.LOAD_AM - netc.VPP12_AM
    
    C:\Anaconda\lib\site-packages\ipykernel\__main__.py:1: SettingWithCopyWarning: 
    A value is trying to be set on a copy of a slice from a DataFrame.
    Try using .loc[row_indexer,col_indexer] = value instead

whats the proper way to create a field in the newer version of Pandas to avoid getting the warning?

    pd.__version__
    Out[45]:
    u&#39;0.19.2+0.g825876c.dirty&#39;

Correct way to set new column in pandas DataFrame to avoid SettingWithCopyWarning

Pretty sure this is very simple.

I am reading a csv file and have the dataframe:


    Attribute    A   B   C
    a            1   4   7
    b            2   5   8
    c            3   6   9

I want to do a transpose to get

    Attribute    a   b   c
    A            1   2   3
    B            4   5   6
    C            7   8   9

However, when I do df.T, it results in

    
                 0   1   2 
    Attribute    a   b   c
    A            1   2   3
    B            4   5   6
    C            7   8   9`

How do I get rid of the indexes on top?

How do I transpose dataframe in pandas without index?

I am trying to get a new dataset, or change the value of the current dataset columns to their unique values. 
Here is an example of what I am trying to get : 

       A B
     -----
    0| 1 1
    1| 2 5
    2| 1 5
    3| 7 9
    4| 7 9
    5| 8 9
    
    Wanted Result    Not Wanted Result
           A B            A B
         -----          -----
        0| 1 1         0| 1 1
        1| 2 5         1| 2 5
        2| 7 9         2| 
        3| 8           3| 7 9
                       4|
                       5| 8

I don&#39;t really care about the index but it seems to be the problem. 
My code so far is pretty simple, I tried 2 approaches, 1 with a new dataFrame and one without. 

    #With New DataFrame
    def UniqueResults(dataframe):
        df = pd.DataFrame()
        for col in dataframe:
            S=pd.Series(dataframe[col].unique())
            df[col]=S.values
        return df

    #Without new DataFrame
    def UniqueResults(dataframe):
        for col in dataframe:
            dataframe[col]=dataframe[col].unique()
        return dataframe

I have the error &quot;Length of Values does not match length of index&quot; both times.

ValueError: Length of values does not match length of index | Pandas DataFrame.unique()

Seems pretty Googleable but haven&#39;t been able to find something online that works.

I&#39;ve tried both `sns.boxplot(&#39;Day&#39;, &#39;Count&#39;, data= gg).title(&#39;lalala&#39;)` and `sns.boxplot(&#39;Day&#39;, &#39;Count&#39;, data= gg).suptitle(&#39;lalala&#39;)`. None worked. I think it might be because I&#39;m also working with matplotlib.

How to add title to seaborn boxplot

I am using the following code to create a data frame from a list:

    test_list = [&#39;a&#39;,&#39;b&#39;,&#39;c&#39;,&#39;d&#39;]
    df_test = pd.DataFrame.from_records(test_list, columns=[&#39;my_letters&#39;])
    df_test

The above code works fine. Then I tried the same approach for another list:

    import pandas as pd
    q_list = [&#39;112354401&#39;, &#39;116115526&#39;, &#39;114909312&#39;, &#39;122425491&#39;, &#39;131957025&#39;, &#39;111373473&#39;]
    df1 = pd.DataFrame.from_records(q_list, columns=[&#39;q_data&#39;])
    df1

But it gave me the following errors this time:

    ---------------------------------------------------------------------------
    AssertionError                            Traceback (most recent call last)
    &lt;ipython-input-24-99e7b8e32a52&gt; in &lt;module&gt;()
          1 import pandas as pd
          2 q_list = [&#39;112354401&#39;, &#39;116115526&#39;, &#39;114909312&#39;, &#39;122425491&#39;, &#39;131957025&#39;, &#39;111373473&#39;]
    ----&gt; 3 df1 = pd.DataFrame.from_records(q_list, columns=[&#39;q_data&#39;])
          4 df1
    
    /usr/local/lib/python3.4/dist-packages/pandas/core/frame.py in from_records(cls, data, index, exclude, columns, coerce_float, nrows)
       1021         else:
       1022             arrays, arr_columns = _to_arrays(data, columns,
    -&gt; 1023                                              coerce_float=coerce_float)
       1024 
       1025             arr_columns = _ensure_index(arr_columns)
    
    /usr/local/lib/python3.4/dist-packages/pandas/core/frame.py in _to_arrays(data, columns, coerce_float, dtype)
       5550         data = lmap(tuple, data)
       5551         return _list_to_arrays(data, columns, coerce_float=coerce_float,
    -&gt; 5552                                dtype=dtype)
       5553 
       5554 
    
    /usr/local/lib/python3.4/dist-packages/pandas/core/frame.py in _list_to_arrays(data, columns, coerce_float, dtype)
       5607         content = list(lib.to_object_array(data).T)
       5608     return _convert_object_array(content, columns, dtype=dtype,
    -&gt; 5609                                  coerce_float=coerce_float)
       5610 
       5611 
    
    /usr/local/lib/python3.4/dist-packages/pandas/core/frame.py in _convert_object_array(content, columns, coerce_float, dtype)
       5666             # caller&#39;s responsibility to check for this...
       5667             raise AssertionError(&#39;%d columns passed, passed data had %s &#39;
    -&gt; 5668                                  &#39;columns&#39; % (len(columns), len(content)))
       5669 
       5670     # provide soft conversion of object dtypes
    
    AssertionError: 1 columns passed, passed data had 9 columns

Why would the same approach work for one list but not another? Any idea what might be wrong here? Thanks a lot!

Python: create a pandas data frame from a list

I am trying to filter the columns in a pandas dataframe based on whether they are of type date or not.  I can figure out which ones are, but then would have to parse that output or manually select columns.  I want to select date columns automatically.  Here&#39;s what I have so far as an example - I&#39;d want to only select the &#39;date_col&#39; column in this case.

    import pandas as pd
    df = pd.DataFrame([[&#39;Feb-2017&#39;, 1, 2],
                       [&#39;Mar-2017&#39;, 1, 2],
                       [&#39;Apr-2017&#39;, 1, 2],
                       [&#39;May-2017&#39;, 1, 2]], 
                      columns=[&#39;date_str&#39;, &#39;col1&#39;, &#39;col2&#39;])
    df[&#39;date_col&#39;] = pd.to_datetime(df[&#39;date_str&#39;])
    df.dtypes
Out:

    date_str            object
    col1                 int64
    col2                 int64
    date_col    datetime64[ns]
    dtype: object

How do I tell if a column in a pandas dataframe is of type datetime? How do I tell if a column is numerical?

How to get merged data frame from two data frames having common column value such that only those rows make merged data frame having common value in a particular column. 

I have 5000 rows of `df1` as format : -

    	director_name	actor_1_name	actor_2_name	actor_3_name	movie_title
    0	James Cameron	CCH Pounder	Joel David Moore	Wes Studi	  Avatar
    1	Gore Verbinski	Johnny Depp	Orlando Bloom	Jack Davenport	 Pirates 
        of the Caribbean: At World&#39;s End
    2	Sam Mendes	 Christoph Waltz	Rory Kinnear	Stephanie Sigman Spectre


and 10000 rows of `df2` as

    movieId		              genres	                    movie_title
    	1		Adventure|Animation|Children|Comedy|Fantasy	  Toy Story
    	2		Adventure|Children|Fantasy	                  Jumanji
    	3		Comedy|Romance	                           Grumpier Old Men
    	4		Comedy|Drama|Romance	                  Waiting to Exhale
    	


A common column &#39;movie_title&#39; have common values and based on them, I want to get all rows where &#39;movie_title&#39; is same. Other rows to be deleted.

Any help/suggestion would be appreciated. 


Note:  I already tried 

    pd.merge(dfinal, df1, on=&#39;movie_title&#39;)

and output comes like one row 


    director_name	actor_1_name	actor_2_name	actor_3_name	movie_title	movieId	title	genres

and on how =&quot;outer&quot;/&quot;left&quot;, &quot;right&quot;, I tried all and didn&#39;t get any row after dropping NaN although many common coloumn do exist.

Merge two data frames based on common column values in Pandas

I have the following line in a file I&#39;m editing in VSCode:

`...............111.........111.............111..`

I want to replace all `.`s with `0`s. However, when I highlight the line and do a find/replace for `.`s, *all* the `.`s in the document are replaced, not just the ones in the line I&#39;ve select, even when I toggle the &quot;Find in selection&quot; button. Is this a bug? In other editors, if I select a chunk of text and then do a find/replace, it will only find/replace matches within the selected block.

Below is a snippet that you should be able to reproduce the issue with. The `...............111.........111.............111..` line is inside the `test_unicode` function.

    def test_simple2(self):
            &quot;&quot;&quot;Simple CSV transduction test with empty fields, more complex idx, different pack_size.
    
            100011000001000 -&gt;
            ..........111....................111..........11111..........111..
            &quot;&quot;&quot;
            field_width_stream = pablo.BitStream(int(&#39;1000110001000001000&#39;, 2))
            idx_marker_stream = pablo.BitStream(int(&#39;11101&#39;, 2))
            pack_size = 4
            target_format = TransductionTarget.JSON
            csv_column_names = [&quot;col1&quot;, &quot;col2&quot;, &quot;col3&quot;, &quot;col4&quot;, &quot;col5&quot;]
    
            pdep_marker_stream = pablo.BitStream(generate_pdep_stream(field_width_stream,
                                                                      idx_marker_stream,
                                                                      pack_size, target_format,
                                                                      csv_column_names))
            self.assertEqual(pdep_marker_stream.value, 63050402300395548)
    
        def test_unicode(self):
            &quot;&quot;&quot;Non-ascii column names.
    
            Using UTF8. Hard coded SON boilerplate byte size should remain the same, column name
            boilerplate bytes should expand.
    
            100010010000000 -&gt;
            2 + 4 + 9     2 + 4 + 6     2 + 4 + 7
            ...............111.........111.............111..
            &quot;&quot;&quot;
            field_width_stream = pablo.BitStream(int(&#39;100010001000&#39;, 2))
            idx_marker_stream = pablo.BitStream(1)
            pack_size = 64
            target_format = TransductionTarget.JSON
            csv_column_names = [&quot;한국어&quot;, &quot;中文&quot;, &quot;English&quot;]
    
            pdep_marker_stream = pablo.BitStream(generate_pdep_stream(field_width_stream,
                                                                      idx_marker_stream,
                                                                      pack_size, target_format,
                                                                      csv_column_names))
            self.assertEqual(pdep_marker_stream.value, 1879277596)

I&#39;m using VSCode 1.12.2 in Ubuntu 16.04.

Find and replace in Visual Studio code in a selection

I see [how to search and replace in specific lines][1], specifying by line number, and [how to search and replace using the current line as reference to a number of lines down][2].

**How do I search and replace in the current line only?** I&#39;m looking for a simple solution that does not involve specifying line numbers as the linked solutions do.


  [1]: https://stackoverflow.com/questions/17319557/search-and-replace-in-vim-in-specific-lines
  [2]: https://stackoverflow.com/questions/18020381/vim-search-and-replace-using-current-line-as-reference-point

Content Type	Original Author	Original Content on Stackoverflow
Question	d84_n1nj4	View Question on Stackoverflow
Solution 1 - Python	EdChum	View Answer on Stackoverflow
Solution 2 - Python	jpp	View Answer on Stackoverflow

Remove first x number of characters from each row in a column of a Python dataframe

Python Problem Overview

Python Solutions

Solution 1 - Python

Solution 2 - Python

How to define unique index on multiple columns in sequelize

Angular2 Cannot find namespace 'google'

Attributions